Ultra low bit-rate speech coding based on unit-selection with joint spectral-residual quantization: no transmission of any residual information
نویسندگان
چکیده
A recent trend in ultra low bit-rate speech coding is based on segment quantization by unit-selection principle using large continuous codebooks as a unit database. We show that use of such large unit databases allows speech to be reconstructed at the decoder by using the best unit’s residual itself (in the unit database), thereby obviating the need to transmit any side information about the residual of the input speech. For this, it becomes necessary to jointly quantize the spectral and residual information at the encoder during unit selection, and we propose various composite measures for such a joint spectral-residual quantization within a unit-selection algorithm proposed earlier. We realize ultra low bit-rate speaker-dependent speech coding at an overall rate of 250 bits/sec using unit database sizes of 19 bits/unit (524288 phone-like units or about 6 hours of speech) with spectral distortions less than 2.5 dB that retains intelligibility, naturalness, prosody and speaker-identity.
منابع مشابه
Ultra Low Bit-Rate Coders
In this chapter, we present the definition and principles of ultra-low bit-rate coders. Here the emphasis is to point to the fact that this class of coders is typically the ‘vocoders’, which are ‘parametric’ coders that are essentially linear-prediction (LP) based vocoders. This is in contrast to the ‘waveform’ coders, which operate at the higher bit-rates. Among the various frameworks employed...
متن کاملAn unified unit-selection framework for
We propose a unified framework for segment quantization of speech at ultra low bit-rates of 150 bits/sec based on unit-selection principle using a modified one-pass dynamic programming algorithm. The algorithm handles both fixedand variablelength units in a unified manner, thereby providing a generalization over two existing unit selection methods, which deal with ‘single-frame’ and ‘segmental’...
متن کاملLow Bit Rate Speech Coding via TCVRQ
We present a new Trellis Coded Vector Residual Quantizer (TCVRQ) that combines trellis coding and vector residual quantization. We introduce new methods for computing quantization levels and experimentally analyze the performances of our TCVRQ in the case of speech coding at very low bit rates. The results obtained show that transparent quantization of Linear Prediction (LP) parameters can be p...
متن کاملQuantization and Reconstruction of Sources with Memory
A fundamental problem in telecommunications is the reliable transmission of a source over a noisy channel. As an important result of the Shannon’s celebrated paper [1], the problem can be theoretically separated, without loss of optimality, into two parts: source coding and channel coding. However, in practise, due to the strict design constraints, such as the limitations on complexity of the s...
متن کاملLow bit-rate speech coding using quantization of variable length segments
This paper describes a new segmentation and quantization technique for low bit-rate speech coders. Bit-rate reduction is achieved by combining segmentation and quantization, where a segment consists of one or more adjacent frames. The algorithm for selecting and quantizing segments from a pre-determined number of frames extends the frame-based trellis techniques. It models the input speech as a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009